Distribution Fields with Adaptive Kernels for Large Displacement Image Alignment

نویسندگان

  • Benjamin Mears
  • Laura Sevilla-Lara
  • Erik G. Learned-Miller
چکیده

While region-based image alignment algorithms that use gradient descent can achieve sub-pixel accuracy when they converge, their convergence depends on the smoothness of the image intensity values. Image smoothness is often enforced through the use of multi-scale approaches in which images are smoothed and downsampled. Yet, these approaches typically use fixed smoothing parameters which may be appropriate for some images but not for others. Even for a particular image, the optimal smoothing parameters may depend on the magnitude of the transformation. When the transformation is large, the image should be smoothed more than when the transformation is small. Further, with gradient-based approaches, the optimal smoothing parameters may change with each iteration as the algorithm proceeds towards convergence. We address convergence issues related to the choice of smoothing parameters by deriving a Gauss-Newton gradient descent algorithm based on distribution fields (DFs) and proposing a method to dynamically select smoothing parameters at each iteration. DFs have previously been used in the context of tracking [6]. In this work we incorporate DFs into a full affine model for region-based alignment and simultaneously search over parameterized sets of geometric and photometric transforms. We use a probabilistic interpretation of DFs to select smoothing parameters at each step in the optimization and show that this results in improved convergence rates. Following the notation of Baker and Matthews [1], let T (x) be an image containing a fixed region for which we want to find the corresponding region in I(x), where x = (x,y)T is a column vector. We refer to T (x) as the template image and I(x) as the input image. Let W (x;p) be the parameterized set of warps, where p are the parameters. The goal of regionbased alignment is to find the p̂ that minimizes some distance measure between T (x) and I(W(x; p̂)). One of the early image alignment algorithms was the gradient-based Lucas-Kanade (LK) algorithm [3]. In the LK method, the feature space consists of intensity values and the similarity measure is the L2 distance. Our method is inspired by the LK algorithm, but we replace the image and template with their DF representations. The basic idea of a DF is to represent a region in an image as a normalized histogram, i.e., a probability distribution, over feature values at each pixel. In this work we use grayscale intensity values as the feature values although other features could be used instead (e.g. edge intensities, RGB values for color images, etc.). The simplest DF consists of probability distributions over binned intensity values where each probability distribution is degenerate and is given by

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Image alignment via kernelized feature learning

Machine learning is an application of artificial intelligence that is able to automatically learn and improve from experience without being explicitly programmed. The primary assumption for most of the machine learning algorithms is that the training set (source domain) and the test set (target domain) follow from the same probability distribution. However, in most of the real-world application...

متن کامل

Image Classification via Sparse Representation and Subspace Alignment

Image representation is a crucial problem in image processing where there exist many low-level representations of image, i.e., SIFT, HOG and so on. But there is a missing link across low-level and high-level semantic representations. In fact, traditional machine learning approaches, e.g., non-negative matrix factorization, sparse representation and principle component analysis are employed to d...

متن کامل

Analysis of Large-scale Displacement Using Radar Interferometry Technology in Open-pit Mines (Case Study: Gol Gohar Sirjan Mine)

  Abstract Mines with their displacements cause damage to infrastructure and land, so it is important to control the displacement caused by mines. Also, the amount of minerals extracted from mines has always been of interest to managers. On the other hand, radar interferometry, given the production of repetitive and full images and the acceptable accuracy of this method, in examining be sugge...

متن کامل

Convolutional Neural Pyramid for Image Processing

We propose a principled convolutional neural pyramid (CNP) framework for general low-level vision and image processing tasks. It is based on the essential finding that many applications require large receptive fields for structure understanding. But corresponding neural networks for regression either stack many layers or apply large kernels to achieve it, which is computationally very costly. O...

متن کامل

Synthesis of human facial expressions based on the distribution of elastic force applied by control points

Facial expressions play an essential role in delivering emotions. Thus facial expression synthesis gain interests in many fields such as computer vision and graphics. Facial actions are generated by contraction and relaxation of the muscles innervated by facial nerves. The combination of those muscle motions is numerous. therefore, facial expressions are often person specific. But in general, f...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013